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cache con^ol structures, A cache is marked as qualified when full or there is no more data to be written to the cache, allowing burst data 
to be transferred out of said cache. The invention has applicaticms in network adapter caids for transferring data between a host system bus 
and a neiworic where the bus and network operate at different speeds and are therefore part of different clock domains 
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QUALIFIED BURST BUFFER 

5 

The present Invention relates to comnunicatlon of 
digital information. More particularly, the invention relates 
to using one or more qualified burst caches to improve the 
data transfer between two clock domains where at least one of 
10 the clock domains supports burst data transfers, such as a 
host computer bus that can support burst transfers and a 
network. 

BACKGROUND OF THE INVENTION . 

15 The present invention is best understood in the 

context of a networic adapter. A network adapter is a device 
well known in the art for connecting a digital computing 
device, such as a personal computer, to a communications 
network such as a local area nel^rk, or LAN. 

20 One type of network adapter is discussed in 

coassigned U.S. Patent Number 5,412,782, which is incorporated 
herein by reference. As described in that application, a 
network adapter generally connects to a multipurpose host 
computer data bus through programmed I/O (PIO) , possibly with 

25 a direct memory access (DMA) mode available as a backup for 
receive operations. The described adapter generally was 
designed to operate on a network standard known as lOBaseT 
Ethernet, which operates at maximum network data speeds 
approaching 10 Megabits per second (Mbps) . 

30 Various aspects of other types of network adapters, 

some of which may be considered modifications or improvements 
of the just discussed adapter, are described in coassigned 
U.S. application s/n 08/296,577 filed 08/26/94, which 
discusses an adapter for operating at 100 Mbps (100BaseT), 

35 coassigned U.S. application s/n 08/544,745, which discusses an 
adapter connector for operating at both lOBaseT and lOOBaseT, 

and coassigned U.S. application s/n filed April 30, 

1996 and entitled PACKET FILTERING BASED ON SOCKET OR 
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APPLICATION IDENTIFICATION. Each of these pending related 
U.S. applications are incorporated herein by reference. 

In the past several years the capabilities and speed 
of typical host computer systems placed on networks have 
5 improved dramatically and the demands placed on those systems 
have increased. The speed of networks has also increased. 
There is therefore a need for network adapters to evolve to 
handle higher data speeds and to more effectively operate on a 
host's system bus. 

One modification made to high performance adapters 
is to modify how those adapters operate on the system bus. 
Higher performance adapters today can often act as bus master 
devices, rather than as PIO bus devices. Bus master devices 
can generally transfer more data while using less time on the 

15 system bus than can PIG devices. 

Today, high performance system buses, such as EISA, 
and PCI generally allow bus master devices to operate in a 
burst jnode. In burst mode, a bus master is allowed to 
transfer a high amount of data ^r the bus, with one bus word 

20 (up to four bytes) being transferred every bus cycle without 
interruption. This can provide the most efficient use of bus 
resources • 

A problem network adapters have, however, in fully 
utilizing a burst mode on a system bus is that networks 

25 operate at a effective data speed that is generally much less 
than the system bus transmission speed. Therefore, a "burst 
mode" read operation on a bus will quickly drain any data 
received from the network on the adapter, and the network 
cannot deliver data to the adapter quickly enough to meet the 

30 burst mode requirements. For this reason, when moving a block 
of data between a burst mode host computer bus and a network, 
it is necessary to buffer data on the adapter to obtain the 
optimal throughput performance. 

This problem can be thought of in generalized terms 

35 as arising whenever data is exchanged between two different 
computer systems operating at different speeds. These two 
areas of the computing environment can be referred to 
generally as two different clock dcxnaxns. Referring to the 
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example just discussed, the host bus can be thought of as 
representing one clock domain and the network and the adapter 
interface circuitry as another. 

There are two conmon approaches for buffering data 
5 when transferring it between two clock domains: a FIFO. (First- 
In-*First-Out) memory or a high speed Mual-porf RAM (Random 
Access Memory) , either of which be accessed by either clock 
domain in a random manner. 

FIFOs are a well understood memory structure 
10 consisting of memory locations and read and write pointers 
that address one of the possible locations within the FIFO. 
When using a FIFO memory, as data is put into the FIFO, some ^ 
^ incremental delay later it becomes available to be read out of 
the FIFO. FIFOs generally require a large number of circuit 
15 gates to implement because data must be able to be read from 
or written to the FIFO starting at any location in the memory. 
This requires a complex pointer and control structure that is 
difficult to design. A major part of a FIFO design entails 
always keeping track of the relationship between the read and 
20 write pointers and ensuring that there is either room for data 
to be written into the FIFO, or that data being read from the 
FIFO is valid. 

The complexities of FIFO design and control are 
increased when an adapter interfaces to a host bus which 
25 supports a burst mode, especially when the bus master cannot 
pace the transfers, as is the case with EISA - Extended 
Industry Standard Architecture. This is because the EISA 
Specification does not allow the bus master to insert "wait" 
cycles on the bus once a burst transfer has begun and it is 
30 complex to update and synchronize the FIFO pointer updates, at 
the fast burst rate demanded by the bus. 

An alternative prior art approach to using a FIFO is 
using a "dual^port" RAM, but this also has disadvantages. A 
Dual-port RAM is merely a more general implementation of the 
35 above FIFO and therefore suffers from the same read write 
pointer, data valid complexities. 

What is needed is a mechanism that will allow an 
adapter or other device moving data between two clock domains 
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to efficiently transmit data in a burst mode on the faster 
clock domain without the design and control complexities of 
prior art solutions. 

5 SUMMARY OF THE INVENTION 

The present invention entails using one or more 
qualified Jburst caches (QBCs) in transferring data. Each QBC 
is an area of memory of a size N that can be accessed by 
either side of a transfer connection, but not by both sides 
10 simultaneously. 

The invention differs from the Mual-port' RAH 
implementation in that each QBC must be qualified before any 
data is transmitted out of it. When qualified, all the data 
written to a given QBC is marked as valid and the QBC can be 
15 read in burst mode in its entirety. Being qualified indicates 
that either the QBC is full or that no more data will be 
written to the QBC by the transmitting side because the 
transmitting side has no more data to send because the entire 
data block has been transmitted .-^-i 
20 In the current invention, the complexity of 

maintaining the read/write pointers and the data valid issue 
is eliminated. A two bit handshake per QBC is used to 
indicate when a QBC is valid to be read, or when it is empty 
and can be written. In the case where a QBC can be read and 
25 written from both directions, these two bits will have 

different meaning depending on the direction in which data is 
passing through the QBC. 

While the invention may be understood in terms of 
using just one QBC between two different clock domains, much 
30 greater performance advantage can be achieved by using at 
least two independent QBCs, either of which can be accessed 
independently of the other by the different clock domains. 

BRIEF DESCRIPTION OF THE DRAWINGS 

35 

Fig. 1 is a block diagram of an advanced network 
adapter in which the present invention may be effectively 
employed. 
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Fig. 2 is simplified block diagram of a host 
interface using one QBC in accordance with the invention. 

Fig. 3 is simplified block diagram of a host 
interface using more than one QBC in accordance with the 
5 invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
Fig. 1 illustrates one example of an advanced 
network adapter in which the present invention may be 

10 effectively employed. This example is intended to be 
illustrative and not limiting. 

Adapter 10, shown in Fig. 1, comprises a number of 
components coupled between a host computer system bus 20 and 
communications network media 30. 

15 At the host side, host interface circuitry 200 

provides circuitry necessary for effectively operating on the 
host's system bus 20, According to the invention, this 
circuitry includes QBCs as shown in Figure 2. Interface 200 
contains all of the control cirottitry necessary to operate 

20 effectively as a bus master on £us 20, as is described in more 
detail below. 

Interface circuitry 200 on the adapter side connects 
to receive First In First Out (FIFO) circuitry 160 and to a 
transmit (TX) RAM FIFO 190. These RAMS provide one level of 

25 packet data buffering on the adapter, and are designed to 
receive and buffer packets at essentially network speeds. 
These on-adapter FIFOs may also both be coupled to an off or 
on-chip additional RAM FIFO 190 which provides further data 
buffering. Receive control circuitry 130 and transmit control 

30 circuitry 140 are both coupled to ethernet controller 150 
which formats data appropriately for transmission on a 
ethernet-type network. 

The example adapter shown in Figure 1 also includes 
two sets of encoder/decoder circuits and two transceivers, 

35 allowing the adapter to operate at two different network 
speeds. As an example, decoder circuitry 115 and encoder 
circuitry 120 connect with transceiver 100 and Attachment Unit 
Interface (AUI) 110 to allow for transmissions at lOBaseT 
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ethernet and decoder circuitry 117 and encoder circuitry 122 
connect with transceiver 265 to allow for transmissions at 
100BaseT ethernet. 

A subset of the circuitry discussed thus far is 
5 shown in Fig. 1 enclosed by a dotted line to indicate that 
this circuitry may all be contained within a single 
Application Specific Integrated Circuit (ASIC) 210. ASIC 210 
also employs an EEPROM 220 coupled to host interface 200 to 
provide configuration data such as station address, and it may 

10 also optionally employ a boot PROM 230 for automatic network 
configuration. Also contained within and employed throughout 
ASIC 210 are clocks 240 and network management circuitry 250 
to manage various network status signals. A voltage 
controlled oscillator (VCO) 270 is also coupled to decoder 115 

15 and encoder 120. Those circuit units included in ASIC 210 are 
included as examples, and different configurations of ASIC 210 
including different sets of the components shown in Pig. 1 are 
possible. 

In one example of an a[a'apter according to the 
20 invention, data transfer operations between adapter 10 and the 

host are performed preferably through adapter 10 acting as a 

bus master on host bus 20. 

One complication of the function of adapter 10 is 

that the data on bus 30 is generally transferred at an 
25 effective data rate of up to 100 Mbps, while the effective 

data rate on system bus 20 can be as high as 1,056 Mbps. As 

discussed above, these two different speeds of operation may 

be referred to as two clock drains. 

In the circuit shown in Fig. 1, the two clock 
30 domains exist at either side of host interface 200. Clock 

Domain 1 (CDl) includes the host system bus and bus interface 

circuitry within host interface 200. Clock Domain 2 (CD2) 

includes the other adapter components shown in Fig. 1 and the 

network 30. 

35 According to the invention, a qualified burst cache 

mechanism resides within host interface 200 and provides a 
means for transferring burst data between the two clock 
domains, CDl and CD2. Using the QBC, the invention allows for 
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efficient use of the high speed system bus 20 in burst mode 
even when data is being transmitted as much slower speeds 
within the network clock domain. 

Pig. 2 is a block diagram illustrating burst data 
5 transmission between CDl and CD2 using a QBC 210 according to 
the invention. According to the invention, interface 200 
maintains two state machines to control QBC 210, state machine 
one (SMI) 220 controls reads and writes to QBC 210 from CDl, 
and state machine two (SM2) 230 controls reads and writes to 

10 QBC 210 from CD2. 

QBC 210, according to one embodiment of the 
invention, is controlled by two 1-bit handshaking signals, BDl 
(bit domain 1) and BD2 (bit domain 2) and the state of the 
direction of data transfer, which is determined by other 

15 circuitry on the adapter card and is known to SMI and SM2. 

These three states indicate whether the cache is being written 
or read by a particular clock domain and when it is ready for 
access by the other domain. 

The operation of QBC 2=10 with respect to these three 

20 states in accordance with one embodiment of the invention is 
illustrated in the following table. 



Table 1 



25 


CDl — >CD2 


BDl 


BD2 






1 


0 


0 


QBC is empty and 










nay be filled by CDl 




1 


0 


1 


CD2 is reading data from QBC 


30 


1 


1 


0 


QBC is full and valid, waiting to be 








read by CD2 




0 


0 


0 


QBC is empty and 










may be filled by CD2 




0 


1 


0 


CDl is reading data from QBC 


35 


0 


0 


1 


QBC is full and valid, waiting to be 








read by CDl 



The operation of the described circuits in 



40 accordance with the signals shown in Table 1 is as follows. 
In the case where data is moving from CDl to CD2, i.e. CDl— ■ 
>CD2 « 1, BDl and BD2 both being 0 indicates that the QBC is 
idle and may be written to at any time by CDl. Once CDl has 
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filled QBC 210, it sets BDl to 1. This informs SM2 that QBC 
210 contains valid data and can be read by CD2 at any time. 

When CD2 begins reading data from QBC 210, it sets 
BDl to 0 and sets BD2 to 1. When CD2 has finished reading 
5 data from QBC 210, SH2 deasserts B02. This is the interlock 
which tells SMI that QBC 210 is again available to accept 
data. No data can be written to a given QBC if it is 
currently being read out of by the other clock domain. 

According to a further embodiment, a QBC according 
10 to the invention is able to efficiently handle a situation 
where the QBC is not full but all the data to be transferred 
has been entered into the QBC and the transfer is therefore 
complete. In this embodiment, the state machines SMI and SM2 
are enabled to receive a signal XFERCNT(N) which specifies the 
15 number of valid bytes present inTthe QBC. This signal is 
presented from the writing SM to the reading SM at the same 
time that reading is enabled. In one specific example, this 
signal is transmitted as a parallel word wide enough to 
indicate the entire QBC. With a QBC of 128 bits, for example, 
20 XFERCNT{N) would by a seven bit wide signal. 

Figure 2 shows additional control signals used 
by host interface 200. In a specific embodiment, these 
signals as well may include parallel bit signals. These 
signals have the following functions: 
25 RXLEVEL - used by RX FIFO to indicate how much data it has. 

RXWAIT - used by RX FIFO to indicate that it is momentarily 

unable to supply RX data on DATA. This occurs when 
data is being fetched from the FIFO RAM. RXWAIT is 
independent of RXLEVEL. 



wo 97/41514 



PCTAJS97/07167 



RXCOMPLETE - used by RX FIFO to indicate that the current 
iW packet has been fully received into the 
FIFO RAM. The logic in (200) uses RXCOMPLETE 
and RXLEVEL « 0 to determine when all of the 
5 data for an RX packet has been moved out of RX 

FIFO* 

RXRD read strobe use by (200) to indicate that it has 

taken some data from RX FIFO on DATA. RX FIFO uses 
this signal to advance the FIFO to the next datum. 
10 (200) shall not assert RXRD when RXWAIT is true, nor 

when RXLEVEL » 0. 

TXFULL - used by TX FIFO to indicate that it has no more 

space available. 

TXWAIT - used by TX FIFO to indicate that momentarily it is 

15 unable to accept^ data on DATA. This occurs when TX 

FIFO is busy storing the last datum to the FIFO RAM. 
TXWR - used by (200) to write the data on DATA to the TX 

FIFO. TXWR shall not be asserted when TXWAIT is 

true, nor when TXFULL is true. 

20 Multiple OBCs 

As suggested above, the performance improvement of 

a QBC according to the invention can be greatly enhanced when 

two or more memory caches in a QBC are used to facilitate . 

burst data transfers between two clock domains. Fig. 2 
25 illustrates a host interface 200 containing two QBCs 210 and 

215. In this embodiment, QBCl can be read by CD2, while QBC2 

is being filled by CDl. 

Where there are two QBCs, a total of four handshake 

signal bits are used to control reads anC writes to the QBCs, 
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two for QBCl 210 and two for QBC2 215. In addition, the state 
machines keep track of which was the last QBC either read from 
or written to and force the QBCs to be accessed sequentially 
for both reads and writes. Each QBC is essentially controlled 
5 independently according to the signals shown in Table 1. 

According to the embodiment shown in Figure 3, when 
data is being written from CDl to CD2, CDl first fills QBCl, 
then CDl fills QBC2, then CDl examines QBCl to see if the data 
placed there has been read yet and if QBCl is empty. If it 

10 is, SMI continues writing data. If QBCl is not empty, SMI 
halts writing data and waits for SM2 to deassert signal 1BD2. 

When there are multiple QBCs, in order to 
efficiently handle the situation where a QBC is not full but 
all the data to be transferred has been entered into the QBC 

15 and the transfer is therefore cemplete, SMI and SM2 are 
enabled to send and receive a signal xrERCNT(N) which 
specifies the number of valid bytes present in the QBC, as 
well as signals QBCIIAST, QBC2IAST indicating which of the 
QBCs hold the last of the data to be transferred. These 

20 signals are presented from the writing SM to the reading SM at 
the same time that reading is enabled. 

Alternative Embodiments 

The present invention is not limited to 
25 transferring data between a host and a network, but can be 
generalized to data transmissions between any two separate 
clock domains or any two logic functions which exist in 
separate clock domains. 

The present invention is also not limi'wed to just 
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one or two alternating burst caches, but the techniques of the 
invention may expanded to any number of burst caches, each 
having its own 2*bit handshake. In this embodiment, QBCs 
would be written and read sequentially from cache 1 to N and 
5 back to 1 again based on whether the next cache is available 
to be written or is valid. 

The invention has now been explained with reference 
to specific embodiments. Other embodiments will be apparent 
to those of skill in the art. In particular, method steps 

10 have been grouped and labeled as being part of various 

sub-methods in order to increase clarity of the disclosure, 
however, these steps could be differently grouped without 
changing the essential operation of the invention. It is 
therefore not intended that this invention be limited, except 

15 as indicated by the appended claims. 
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WHAT TS CLAIMED IS; 

1 1. A qualified burst cache for transferring 

2 blocks of data between a first clock domain and a second clock 

3 domain comprising: 

4 a) a first memory cache for storing a block of data; 

5 b) a first state machine associated with said first 

6 clock domain for determining when said first memory cache may 

7 be read to or written from by said first clock domain and for 

8 performing lock out? and 

9 c) a second state machine associated with said second 

10 clock domain for determining when said second memory cache may 

11 be read to or written from by said first clock domain and for 

12 performing lock out. 

1 2. The device according to claim 1 further 

2 comprising hand shaking signals for communicating between said 

3 first state machine and said second state machine. 

1 3. The device according to claim 1, further 

2 comprising a second memory cache for storing a second block of 

3 data simultaneously to data being read from or written to said 

4 first memory cache. 

1 4. The device according to claim 1, further 

2 comprising a plurality of second memory caches for storing a 

3 plurality of second blocks of data simultaneously to data 

4 being read from or written to a different memory cache. 



1 



5. The device according to claim 3 wherein each 
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2 memory cache is controlled by two handshake signals and by the 

3 state of the direction of data transfer* 



1 6. The device according to claim 2 wherein at 

2 least one state machine communicates to another state machine 

3 a signal indicating an amount of data in said qualified memory 

4 cache . 

1 7. A method for transferring data between a first 

2 clock domain and a second clock domain comprising: 

3 storing data in a memory cache until said memory cache 

4 is full or until there is no more data to be received; 

5 marking said memory cache as qualified; and 

6 transferring data from said memory cache in a burst mode 

7 until said memory cache is empty". 



1 8. A data communications adapter apparatus for 

2 coupling a host computer to a computer network employing 

3 communications media, the adapter comprising: 

4 a. a transceiver coupled to receive and transmit 

5 data over the media; 

6 b. a transmit data buffer; 

7 c, data transmit control circuitry coupled to said 

8 transceiver, to said transmit data buffer, and to said host 

9 computer, for generating a packet transmit signal causing said 

10 transceiver to begin transmitting data from said transmit data 

11 buffer over said communications media; 

12 d. a receive data buffer; 

13 e. data receive control circuitry coupled to said 
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transceiver, to said receive data buffer, and to said host 
computer, for storing data received by said transceiver in 
said receive data buffer; 

f . host interface circuitry; and 

g. a qualified burst cache for receiving and 
transmitting data on said host computer's system bus in burst 

20 mode and for receiving and transmitting data on said adaptor 

21 at computer network speeds* 

1 9. The adapter of claim 8, further comprising: 

2 h. ethernet control circuitry; and 

3 wherein said ethernet control circuitry, said host 

4 interface circuitry, said (jualified burst cache, said data 

5 receive control circuitry, said data transmit control 

6 circuitry, said receive data bufler and said transmit data 

7 buffer are all contained in a single Application Specific 

8 Integrated Circuit (ASIC) • 

1 10. The adapter of claim 8, wherein said data 

2 receive control circuitry is programmable. 

1 11. The adapter of claim 8, further comprising 

2 first and second state machines to control data access to said 

3 qualified burst cache. 

1 12. The adapter of claim 11, wherein said state 

2 machines communicate via three handshaking signals, two 

3 signals indicating whether said cache is being accessed by a 

4 either said host or said network, and one indicating a number 
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5 Of valid bytes of data in a qualified cache. 
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1 13, A method of transferring a block of data from 

2 a first clock domain to a second clock domain, said method 

3 comprising the steps of: 

4 i. placing data in an idle qualified burst cache by 

5 said first clock domain until said cache is full or until 

6 there is no more data to be placed into said cache; 

7 j. thereupon asserting a first handshaking signal 

8 indicating to said second clock domain that data in said cache 

9 is valid and may be read by said second clock domain; 

10 k. thereafter, said second clock domain asserting a 

11 second handshaking signal indicating to said first clock 

12 domain that said first clock domain is locked out from writing 

13 to said qualified burst cache vKlle data is being read by said 

14 second clock domain; and 

15 1. thereafter, said second clock domain deasserting 

16 said second handshaking signal indicating to said first clock 

17 domain that said cache is idle and nay again be filled by said 

18 first clock domain. 

1 14. The method according to claim 13 wherein said 

2 first clock domain represents a host system bus and said 

3 second clock domain represents a network. 

1 15. The method according to claim 13 wherein said 

2 first clock domain represents a network and said second clock 

3 domain represents a host system bus. 
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